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METHOD OF POST PROCESSING OCR INFORMATION OBTAINED FROM 
MAILPIECES USING A CUSTOMER SPECIFIC KEYWORD DATABASE AND A 

MAILPIECE SORTING APPARATUS 

5 

Field of the Invention 

The invention disclosed herein relates generally to automated mail sorting and 
10 more particularly, a method of post processing addressee information using a keyword 
database to determine the intended mailpiece destination. 

Background of the Invention 

||5 The processing and handling of mailpieces consumes an enormous amount of 

^1 human and financial resources, particularly if the processing of the mailpieces is done 
manually. The processing and handling of mailpieces not only takes place at the Postal 

hk Service, but also occurs at each and every business or other site where communication 

m 

, via the mail delivery system is utilized. That is, various pieces of mail generated by a 
f^20 plurality of departments arid individuals within a company need to be addressed, 
Q collected, sorted and franked as part of the outgoing mail process. Additionally, 

incoming mail needs to be collected and sorted efficiently to ensure that it gets to the 
H addressee (i.e. employee or department) in a minimal amount of time. Since much of 

the documentation and information being conveyed through the mail system is critical in 
25 nature relative to the success of a business, it is imperative that the processing and 

handling of both the incoming and outgoing mailpieces be done efficiently and reliably 

so as not to negatively impact the functioning of the business. 

Various automated mail handling machines have been developed for processing 

incoming mail (removing individual pieces of mail from a stack and performing 
30 subsequent actions on each individual piece of mail). Generally, the mail handling 

machines separate individual mailpieces from a stack, read the mailpieces using an 

optical character recognition (OCR) system and compare the read information to an 

addressee database in order to determine the appropriate destination points for delivery 
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of the maiipieces. Some of the incoming mail received at a mailroom of the company 
can be unreadable by the OCR system, the quantity of which can be great since 
recipients cannot control the addressee format in which the incoming mail is received. 
Some of the unreadable mail could de, for example, mail which is not OCR readable 
5 "OCR rejects" (i.e. smeared or needs to be opened to determine addressee), "mystery 
mail" which mail with no particular addressee (i.e. mail addressed to a company or 
department only or mail with poor quality handwriting), or "research mail" (i.e. mail that 
can not be read by OCR but does not require opening for the operator to determine the 
addressee, including the situation where there are several potential addressees with the 
10 same name). The unreadable mail, which will be referred to generally as "reject mail" is 
expensive to process since it drains the resources of the mail room requiring additional 
time and labor for sorting and delivery. Therefore, it would be helpful if the maipieces 
Q for which an intended recipient has not been identified could be processed additionally, 
9 quickly and in an automated fashion so as not to encumber the resources of the 
fi|i5 mailroom. 

Previously, if a determination could not be made by the incoming mail handling 
^1 machine as to the addressee, a video image of the mailpiece was viewed by an 

n 

operator and in the case where the addressee image was readable by the operator, 
addressee information was keyed into the system and associated with an identification 
%o number for the mailpiece. This is typically done after the unreadable maiipieces are 
sorted into a reject bin because it requires time to make the determination and provide 
the information to the system for proper sorting. The previously rejected maiipieces are 
then resorted by reading the identification information which can be printed on the mail 
during the first sort. The identification information is linked with the addressee 
25 information manually keyed in by the operator during the reject processing / video 
coding sequence and is used to sort the mailpiece to the proper destination bin. 

Video processing of maiipieces has been performed at on-site video coding 
terminals or off-site video coding facilities where the video image is transmitted for 
determination of addressee by an operator. The information is then transferred back to 
30 the sorting apparatus. The software and hardware costs associated with video 
processing can be high because video coding requires additional computer systems, 
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image servers and workstations. Additionally, licensing fees for video coding software 
can be expensive. Video coding can also be labor intensive because the operator has 
to input information using a keyboard. While predictive keying can be used, the 
operator is still bogged down with using his or her hands to input addressee information. 
5 Thus, a separate video coding operator is needed in apart from the incoming mailpiece 
sorting apparatus operator in order to keep throughput on the sorting apparatus while 
processing rejects. 

Thus one of the problems of the prior art is that a system is not available for 
providing additional automated identification of addressees or destinations. Another 
10 problem of the prior art is that a system is not available which provides higher 
throughput and decreased labor costs. Another problem of the prior art is that is can 
be expensive. Yet another problem of the prior art is that incoming mail handling 
CI machines do not Include additional functionality for using automation to detennlne the 
intended recipient. Therefore, a method of processing mailpieces with unidentifiable 
15 addressees (rejects) is needed which integrates reject processing with the mailpiece 
fl sorting apparatus with increased read rates and with greater throughput. 
U 

hh Summary of the Invention 

tj20 This invention overcomes the disadvantages of the prior art by providing a 

Q method of processing reject mailpieces with better throughput and lower labor costs. 



.1^ 



This in turn affords quicker mailpiece processing. The present invention is directed, in 
general to automated mailpiece sorting apparatus and more particularly, a method of 
processing rejected mailpieces using an automated mailpiece sorting apparatus with a 
25 customer specific (i.e. employer / company using the mail sorting apparatus) keyword 
database and processing addressee information to increase the identification rate of the 
intended recipient(s) of mailpieces. The mailpiece sorting apparatus may generally 
comprise a feeder, a scanner, a mailpiece deliverer, compartments or bins for receiving 
sorted mailpieces, optical character recognition system (OCR) for reading addressee 
30 information, a personal computer (PC) or microprocessor based control system, 
recipient matching software and an addressee database. The mailpiece sorting 
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apparatus of the present invention also comprises a customer specific keyword 
database for post OCR processing of mailpieces. 

In an embodiment of the present invention, mailpieces are sorted by the 
mailpiece sorting apparatus during a first-pass sort (i.e. on first pass mailpieces are 
5 divided into available bins, on next pass, mailpieces from one or more bins of the first 
pass are resorted into available bins). Mailpieces that can be read and addressees 
identified using the OCR system or the post OCR processing application are sent to 
their designated sort bins. Mailpieces that the mailpiece sorting apparatus OCR system 
and post OCR processing application cannot determine the recipient for ("rejects") are 
10 sent to a reject bin. Once the first pass sorting is completed, the rejects may be 
processed using a reject sorting mode such as video coding, manual Identification or 
voice recognition system or other system determined by one of ordinary skill In the art 
Q using factors such as cost and throughput. 

]| An advantage of the method of the present invention is that it provides higher 

Wi5 throughput with minimal additional hardware, software and labor costs. Another 
ii additional advantage of the present invention is that there is higher throughput of sorted 
mailpieces. Another advantage of the present invention is that it allows for additional 
H automated processing. Other advantages of the invention will in part be obvious and 
II will in part be apparent from the specification. The aforementioned advantages are 
^|20 illustrative of the advantages of the various embodiments of the present invention. 

Description of the Drawings 

The above and other objects and advantages of the present invention will be 
25 apparent upon consideration of the following detailed description, taken in conjunction 
with accompanying drawings, in which like reference characters refer to like parts 
throughout, and in which: 

Fig. 1 is a block diagram that illustrates a computer system upon which an 
30 embodiment of the invention may be implemented. 

Fig. 2a illustrates the connection of the computer system to the sorting 
apparatus. 
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Fig. 2b is a block diagram illustrating a four bin module which may be part of the 
mailpiece sorting apparatus used to perform an embodiment of the method of the 
present invention. 

Figs. 3a-3b illustrate various reject mailpieces. 

Fig. 4a illustrates a functional block diagram of an embodiment of the method 
and apparatus of the present invention. 

Fig. 4b is an upper level flow chart illustrating the post process method of an 
5 embodiment of the present invention. 

Fig. 5 is a flowchart of an embodiment of the post processing method of in the 
present invention in a single sort pass scenario. 

Figs. 6a-b illustrate a flowchart of an embodiment of the post processing method 
of the present invention in a multiple sort pass scenario. 



j| In describing the present invention, reference will be made herein to Figs. 1-6 of 

?J the drawings in which like numerals refer to like features of the invention. Features of 
s 15 the invention are not necessarily sfiown to scale in the drawings. 



CI HARDWARE OVERVIEW 

C| Fig. 1 is a block diagram that illustrates a computer system 100 upon which an 

embodiment of the invention may be implemented. Computer system 100 may be a 
20 personal computer which is used generically and refers to present and future 
microprocessing systems with at least one processor operatively coupled to user 
interface means, such as a display 102 and keyboard 104, and/or a cursor control, such 
as a mouse or a trackball 106, and storage media 108. The personal computer 100 
may be a workstation that is accessible by more than one user. The personal computer 
25 also includes a conventional processor 110, such as a Pentium® microprocessor 
manufactured by Intel, and conventional memory devices such as hard drive 108, floppy 
drive(s) 112, and memory 114. 

The computer system 100 can be connected to a sorting apparatus 8 as 
illustrated in Fig. 2a. The mailpiece sorting apparatus 8 may generally comprise a 
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feeder 10, a line scan camera 14 (and OCR software, not shown), a mailpiece 
transporter 16, a bin module 20 (shown in Fig. 2b) with compartments or bins 18, 18' for 
receiving sorted mailpieces 30 and a control system 100 which may be the 
microprocessor based personal computer system described above. The computer 
5 system 100 can run the voice recognition software. The computer system 100 includes 
appropriate memory devices 108, 114 for storage of information such as an address 
database 22, One of ordinary skill in the art would be familiar with the general 
components of the sorting apparatus upon which the method of the present invention 
may be performed. 

10 The mailpiece sorting apparatus 8 and the OCR software may be used to 

detennine the addressee of the mailpiece 30 or other information on the face of the 
mailpiece 30. The reading of various information may be performed with the assistance 
Q of intelligent character recognition (ICR) or imaging and character recognition (OCR/IC) 
y which may be part of the above mentioned OCR software and can read the various 

44 

Pji5 fields on the mailpiece 30. 

44. 
H 

S| REJECT MAILPIECES 

hh Figs, 3a-3b illustrate various reject mailpieces 30 which can be unidentifiable by 

C j an OCR system. Post processing of the present invention can increase the read rate of 

'S 

reject mailpieces. The term "post processing" of OCR information refers to processing 
20 after the OCR system has attempted to make identification of the addressee. Figure 3a 
is an example of a reject mailpiece 30 which is unreadable by the OCR system because 
the addressee information determined by the OCR system does not match information 
in the addressee database ("OCR Reject"). However, some of the information may be 
identifiable using a customer specific keyword database (the term customer specific 
25 refers to a particular customer or business using the mailpiece sorting apparatus to sort 
mailpieces). The addressee database has various fields that contain addressee 
information including addressee name field and an addressee location field. The 
customer specific keyword database contains information relating to the fields contained 
in the addressee database. For example, the address database location field may 
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contain locations such as mailstop codes. Additional information such as, for example, 
maiistops are numeric only, receivable is in the destination name of only one 
department, there are 20 floors in the destination floor field and no 13**^ floor may also 
be included in the customer specific keyword database. 
5 For a specific customer or company, such as the example customer incoming 

mailpiece of Fig. 3a, the mailstop codes are numeric in nature (not alpha-numeric). This 
information can be contained in the customer specific keyword database. Using this 
example, if the OCR were to read a mailstop code from a mailpiece and determine that 
the code was "0S21", no match would be made to a particular addressee since the 
10 customer's mailstop codes are not alpha numeric. Since a match was not made, post 
processing is perfomned. During post processing, the mailcode is cleansed using 
customer specific keyword information. Since in this example, the customer's mailstop 
codes are numeric only, the OCR result mailstop "0S21" is cleansed and is now 
interpreted as "0521". A match can then be made to information in the addressee 
fl|i5 database. 

In another example, for the incoming mailpiece of Fig. 3b, the OCR determines 



CI 
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H from a mailpiece that the destination is "accnts receivable". No match can be found in 



U the addressee database. Using post processing, the customer specific keyword 
^ database indicates that only one department destination for the customer includes the 
^"'ho word "receivable", in this example the department is "accounts receivable", the 

addressee destination is interpreted as "accounts receivable". A match can then be 

made to information in the addressee database. 



POST PROCESSING OF ADDRESSEE INFORMATION 

The present invention is related to the use of computer system 100 connected to 
25 the mailpiece sorting apparatus 8 for performing application software methods. The 
method of the present invention is used to post process mailpieces 30 which are 
unidentifiable by the OCR system ("reject" mailpieces) in order to determine the 
appropriate addressee, intended destination or recipient. 

Figure 4a is a functional block diagram of an embodiment of the method and 
30 apparatus of the present invention illustrating the flow of a post processing application. 
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The mailpiece sorting apparatus 8 and OCR 14 pass reject mailpiece information to a 
post processing software application 34 which uses a keyword database 22' to attempt 
to determine an appropriate addressee for the previously unidentifiable addressee of 
mailpiece 30. if the addressee is identifiable using the post processing software 
application 24 and keyword database 22' then the mailpiece is sorted to a sort bin 18 for 
identifiable mailpieces. If the addressee is not identifiable using the post processing 
software application 24 and keyword database 22' then the mailpiece is sorted to a 
reject bin 18' (shown in Fig. 2b). The bins may be in one or more bin module(s) 20 
(shown in Figs, 2a-b). 

Figure 4b is an upper level flow chart illustrating the post process method of an 
embodiment of the present invention. Mailpiece sorting apparatus 8 is connected to an 
addressee database 22 and a customer specific keyword database 22'. Block 38 
illustrates customer specific data entry relating to specific keywords such as, for 
example, department names (payable, human, company specific building names) etc. 
This information is stored in database 22'. At block 40 the OCR system of mailpiece 
sorting apparatus 8 makes a comparison of information obtained by the OCR system 
with an addressee database to attempt to find an addressee match. At block 42, for 
mailpieces for which a match was not made at block 40, a comparison of information 
obtained by the OCR system is made to the customer specific keyword database. At 
block 44 a query is made as to whether an addressee match has been made . If an 
addressee match has been made, at block 46, the mailpiece(s) are delivered to 
appropriate bin(s) 18. If an addressee match has not been made, at block 48, the 
mailpiece(s) are delivered to a reject bin 18'. 

Fig. 5 is a flowchart of an embodiment of the post processing method of the 
present invention in a single sort pass scenario. At step S200 the method begins. At 
step S202 a stack of mailpieces (not shown) is placed on the feeder 10 of the mailpiece 
sorting apparatus 8. At step S204 the feeder 10 is set to auto feed and the mailpieces 
30 are moved along the feedpath the mailpiece sorting apparatus 8. At step S206 the 
mailpieces 30 are read using the OCR system. At step S207 information obtained using 
the OCR system is compared to information in a addressee database 22 of the 
mailpiece sorting apparatus 8. At step 208 a query is made as to whether the 
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addressee can be determined from the information read by the OCR system. If the 
answer to the query S208 Is yes, then the mallpiece 30 is delivered to the appropriate 
sort bin 18 at step S209. If the answer to the query S208 is no, then infomiation 
obtained by the OCR system is compared with a customer specific (the terni "customer 
5 specific" refers to a particular customer or business using the mallpiece sorting 
apparatus to sort mailpieces) keyword database 22' of the mallpiece sorting apparatus 
8. If the answer to the query S208 is no, then the information obtained using the OCR 
system is compared to information in a customer specific keyword database at step 
S210. Next, at step S211 a query is made as to whether the addressee can be 
10 determined. If at step S211 the addressee can be detennlned, then at step S212 the 
mallpiece is delivered to an appropriate sort bin. If at step S211 the appropriate 
addressee cannot be determined, then the mallpiece is delivered to a reject bin 18' at 
Q step S213. Next, a query Is made at step S214 as to whether there are additional 
j| mailpieces 30 to be processed. If the answer to the query of step S214 is yes, then 
Wi5 steps S206 through S214 are performed until no mailpieces 30 are left to be processed. 
hh It the answer to the query at S214 is no, then the method ends at S216. 
'^^ Reject mailpieces may be processed further using other methods such as, for 

1"^* example, video coding or voice recognition. Such additional processing may be 
Q determined by one of ordinary skill in the art; cost and throughput factors may be used 
'^^20 In making the detemrilnation. 

^ Figs. 6a-b illustrate a flowchart of an embodiment of the post processing method 

of the present invention in a multiple sort pass scenario. At step S300 the method 
begins. At step S302 a stack of mailpieces (not shown) is placed on the feeder 10 of 
the mallpiece sorting apparatus 8. At step S304 the feeder 10 is set to auto feed and 
25 the mailpieces 30 are moved along the feedpath the mallpiece sorting apparatus 8. At 
step S306 the mailpieces 30 are read using the OCR system. At step S307 a code is 
printed on the mallpiece 30 for second pass sorting and / or reject processing. At step 
S308 information obtained using the OCR system is compared to infomnation In a 
addressee database 22 of the mailpiece sorting apparatus 8. At step S310 a query is 
30 made as to whether the addressee can be determined from the information read by the 
OCR system. If the answer to the query S310 is yes, then the mailpiece 30 is delivered 
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to the appropriate sort bin 18 at step S312. If the answer to the query S310 is no, then 
information obtained by the OCR system is compared with a customer specific keyword 
database 22' of the mailpiece sorting apparatus 8 at step S314. Next, at step S316 a 
query is made as to whether the addressee can be determined. If at step S316 the 
addressee can be detennined, then at step S318 the mailpiece is delivered to an 
appropriate sort bin. If at step S316 the appropriate addressee cannot be determined, 
then the mailpiece is delivered to a reject bin 18' at step S320. Next, a query is made 
at step S322 as to whether there are additional mailpieces 30 to be processed. If the 
answer to the query of step S322 is yes, then steps S306 through S322 are performed 
until no mailpieces 30 are left to be processed. 

If the answer to the query at S322 is no, then the method proceeds with a second 
pass sort for previously identifiable mailpieces sorted to bin(s) 18. Mailpieces which 
were sorted to reject bin 18' can be further processed using a technique such as video 
coding or voice recognition explained above, prior to second pass sorting performed in 
steps S324-S334. 

At step S324 the feeder 10 is set to automatic feed. At step S326 the mailpieces 
30 are removed from the bins 18, 18'. At step S328 the mailpieces 30 which were 
removed from one or more bins 18 are placed on the feeder 10 for a second pass sort 
The determination as to which mailpiece 30 are run through the second pass sort is 
made by the operator and the equipment in coordination with a iM^evlously detennined 
sort scheme which may be determined by one of ordinary skill in the art. At step S330 
the OCR reads the mailpiece ID 32 (shown in Fig. 3b) on the mailpieces (the mailpiece 
ID 32 was printed on the mailpiece during the first pass sort at step S307). At step 
S332 the mailpiece sorting apparatus 8 determines the appropriate bin 18 for delivery of 
the mailpiece. At step S334 the mailpiece 30 is delivered to the appropriate bin 18, 18'. 
The operation continues until all mailpieces are sorted. At step S336, the method ends. 

An additional feature of the present invention tracks and calculates statistical 
infomnation regarding the rejects, the number of mailpieces successfully post processed 
and the total number of rejects as compared to the total number of mailpieces sorted. 
The mailpiece sorting apparatus could generate report(s) detailing counts of the 
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different types of rejects and post processed rejects as well as operator productivity 
measurements. 

The present invention provides for better throughput and decreased sorting 
costs. It provides the ability to customize OCR decision making without requiring 
changes to the general OCR decision making method. It further provides the ability to 
adapt the post processing database over time to provide increased sort rates as the 
customers database information changes. While the present invention has been 
disclosed and described with reference to a single embodiment thereof, it will be 
apparent, as noted above that variations and modifications may be made therein. It is, 
thus, intended in the following claims to cover each variation and modification that falls 
within the tme spirit and scope of the present invention. 
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